Universal Character Set - определение. Что такое Universal Character Set
Diclib.com
Словарь ChatGPT
Введите слово или словосочетание на любом языке 👆
Язык:

Перевод и анализ слов искусственным интеллектом ChatGPT

На этой странице Вы можете получить подробный анализ слова или словосочетания, произведенный с помощью лучшей на сегодняшний день технологии искусственного интеллекта:

  • как употребляется слово
  • частота употребления
  • используется оно чаще в устной или письменной речи
  • варианты перевода слова
  • примеры употребления (несколько фраз с переводом)
  • этимология

Что (кто) такое Universal Character Set - определение

TYPE OF CHARACTER ENCODING SCHEME
Variable width encoding; Multi-byte character set; Variable-width; Multi-Byte Character Set; Multibyte character; Multi-byte character; Multibyte character set; Multi Byte Character Set; Multi-byte Character Set; MBCS (encoding); Multi byte character set; MB (character set); Multiple-byte character set; Multiple-Byte Character Set; Multiple Byte Character Set; Multiple-byte Character Set; Multiple byte character set; Multiple-byte character; Mixed Multi-Byte Character Set; PCMB (encoding); Mixed Multi-byte Character Set; Mixed multi-byte character set; Mixed multi byte character set; Mixed Multi Byte Character Set; Mixed Multiple-Byte Character Set; Mixed Multiple-byte Character Set; Mixed multiple-byte character set; Mixed Multiple Byte Character Set; Mixed multiple byte character set
Найдено результатов: 5251
Universal Character Set         
STANDARD SET OF CHARACTERS DEFINED BY THE INTERNATIONAL STANDARD ISO/IEC 10646
UCS-16; UCS-2; ISO 10646; ISO/IEC 10646; IEC 10646; ISO10646; Universal code (typography); ISO-10646; Universal Code (Typography); ISO/IEC 10646-1; List of Unicode entities; Iso 10646-1; ISO/IEC 10646-1:2000; Universal character set; ISO/IEC JTC1/SC2/WG2; ISO/IEC 10646-1:2000(E); Universal Character Set; 10646-1:1993; ISO/CEI 10646; ISO/IEC 10646-2; ISO/CEI 10646-1; ISO/CEI 10646-2; ISO/IEC 10646-1:1993; ISO/IEC 10646-2:2001; ISO/CEI 10646-2:2001; ISO/CEI 10646-1:1993; ISO/CEI 10646-1:2000; ISO/IEC 10646:1993; ISO/IEC 10646:2000; ISO/IEC 10646:2001; ISO/IEC 10646:2003; ISO/IEC 10646:2011; ISO/IEC 10646:2012; ISO/IEC 10646:2014; ISO/CEI 10646:1993; ISO/CEI 10646:2000; ISO/CEI 10646:2001; ISO/CEI 10646:2003; ISO/CEI 10646:2011; ISO/CEI 10646:2012; ISO/CEI 10646:2014
<character, standard> (UCS, ISO/IEC 10646) A 1993 ISO and IEC standard character set, also known as "Universal Multiple-Octet Coded Character Set". UCS comes in a 16-bit variant called UCS-2 and a 32-bit variant called UCS-4, which is composed of 16-bit UCS-2 "planes". So far only one 16-bit plane has been defined, which is known as the Basic Multilingual Plane. The implementation of UCS is still in its infancy, though some moves, such as the Java language defining a character to be 16 bits, are suggestive. [Relationship with Unicode?] (1997-07-04)
Universal Coded Character Set         
STANDARD SET OF CHARACTERS DEFINED BY THE INTERNATIONAL STANDARD ISO/IEC 10646
UCS-16; UCS-2; ISO 10646; ISO/IEC 10646; IEC 10646; ISO10646; Universal code (typography); ISO-10646; Universal Code (Typography); ISO/IEC 10646-1; List of Unicode entities; Iso 10646-1; ISO/IEC 10646-1:2000; Universal character set; ISO/IEC JTC1/SC2/WG2; ISO/IEC 10646-1:2000(E); Universal Character Set; 10646-1:1993; ISO/CEI 10646; ISO/IEC 10646-2; ISO/CEI 10646-1; ISO/CEI 10646-2; ISO/IEC 10646-1:1993; ISO/IEC 10646-2:2001; ISO/CEI 10646-2:2001; ISO/CEI 10646-1:1993; ISO/CEI 10646-1:2000; ISO/IEC 10646:1993; ISO/IEC 10646:2000; ISO/IEC 10646:2001; ISO/IEC 10646:2003; ISO/IEC 10646:2011; ISO/IEC 10646:2012; ISO/IEC 10646:2014; ISO/CEI 10646:1993; ISO/CEI 10646:2000; ISO/CEI 10646:2001; ISO/CEI 10646:2003; ISO/CEI 10646:2011; ISO/CEI 10646:2012; ISO/CEI 10646:2014
The Universal Coded Character Set (UCS, Unicode) is a standard set of characters defined by the international standard ISO/IEC 10646, Information technology — Universal Coded Character Set (UCS) (plus amendments to that standard), which is the basis of many character encodings, improving as characters from previously unrepresented typing systems are added.
ISO 10646         
STANDARD SET OF CHARACTERS DEFINED BY THE INTERNATIONAL STANDARD ISO/IEC 10646
UCS-16; UCS-2; ISO 10646; ISO/IEC 10646; IEC 10646; ISO10646; Universal code (typography); ISO-10646; Universal Code (Typography); ISO/IEC 10646-1; List of Unicode entities; Iso 10646-1; ISO/IEC 10646-1:2000; Universal character set; ISO/IEC JTC1/SC2/WG2; ISO/IEC 10646-1:2000(E); Universal Character Set; 10646-1:1993; ISO/CEI 10646; ISO/IEC 10646-2; ISO/CEI 10646-1; ISO/CEI 10646-2; ISO/IEC 10646-1:1993; ISO/IEC 10646-2:2001; ISO/CEI 10646-2:2001; ISO/CEI 10646-1:1993; ISO/CEI 10646-1:2000; ISO/IEC 10646:1993; ISO/IEC 10646:2000; ISO/IEC 10646:2001; ISO/IEC 10646:2003; ISO/IEC 10646:2011; ISO/IEC 10646:2012; ISO/IEC 10646:2014; ISO/CEI 10646:1993; ISO/CEI 10646:2000; ISO/CEI 10646:2001; ISO/CEI 10646:2003; ISO/CEI 10646:2011; ISO/CEI 10646:2012; ISO/CEI 10646:2014
ISO/IEC 10646-1         
STANDARD SET OF CHARACTERS DEFINED BY THE INTERNATIONAL STANDARD ISO/IEC 10646
UCS-16; UCS-2; ISO 10646; ISO/IEC 10646; IEC 10646; ISO10646; Universal code (typography); ISO-10646; Universal Code (Typography); ISO/IEC 10646-1; List of Unicode entities; Iso 10646-1; ISO/IEC 10646-1:2000; Universal character set; ISO/IEC JTC1/SC2/WG2; ISO/IEC 10646-1:2000(E); Universal Character Set; 10646-1:1993; ISO/CEI 10646; ISO/IEC 10646-2; ISO/CEI 10646-1; ISO/CEI 10646-2; ISO/IEC 10646-1:1993; ISO/IEC 10646-2:2001; ISO/CEI 10646-2:2001; ISO/CEI 10646-1:1993; ISO/CEI 10646-1:2000; ISO/IEC 10646:1993; ISO/IEC 10646:2000; ISO/IEC 10646:2001; ISO/IEC 10646:2003; ISO/IEC 10646:2011; ISO/IEC 10646:2012; ISO/IEC 10646:2014; ISO/CEI 10646:1993; ISO/CEI 10646:2000; ISO/CEI 10646:2001; ISO/CEI 10646:2003; ISO/CEI 10646:2011; ISO/CEI 10646:2012; ISO/CEI 10646:2014
Universal Character Set characters         
  • Apple Chancery]]) shows the synthesized common fraction on the left and the precomposed fraction glyph on the right as a rendering the plain text string "1 1⁄4 1¼". Depending on the text environment, the single string "1 1⁄4" might yield either result, the one on the right through substitution of the fraction sequence with the single precomposed fraction glyph.
  • Apple Chancery]]. This font supplies the text layout software with instructions to synthesize the fraction according to the [[Unicode]] rule described in this section.
  • 3}}<sup>b</sup> Limited.}}
   }}
WIKIMEDIA LIST ARTICLE
Mapping of Unicode characters; List of Unicode ranges; Surrogate pair; Mapping of Unicode blocks; Mapping of Unicode graphic characters; Unicode character; Unicode range; Unicode codepage; Universal Character Set Characters; Unicode characters; High Surrogates; High Private Use Surrogates; Low Surrogates; Universal Character Set character; High PU Surrogates; High Surrogates (Unicode block); High Private Use Surrogates (Unicode block); Low Surrogates (Unicode block); Surrogate mechanism; Surrogate code point; High surrogate; Low surrogate; Surrogate code points; Noncharacter; Surrogates in Unicode; Non-character; ﷐; ﷑; ﷒; ﷓; ﷔; ﷕; ﷖; ﷗; ﷘; ﷙; ﷚; ﷛; ﷜; ﷝; ﷞; ﷟; ﷠; ﷡; ﷢; ﷣; ﷤; ﷥; ﷦; ﷧; ﷨; ﷩; ﷪; ﷫; ﷬; ﷭; ﷮; ﷯; 􏿾; 􏿿; 󿿿; 󿿾; 󯿿; 󯿾; 󟿿; 󟿾; 󏿿; 󏿾; 򿿿; 򿿾; 򯿿; 򯿾; 򟿿; 򟿾; 򏿿; 򏿾; 񿿿; 񿿾; 񯿿; 񯿾; 񟿿; 񟿾; 񏿿; 񏿾; 𿿿; 𿿾; 𯿿; 𯿾; 🿿; 🿾
The Unicode Consortium and the ISO/IEC JTC 1/SC 2/WG 2 jointly collaborate on the list of the characters in the Universal Coded Character Set. The Universal Coded Character Set, most commonly called the Universal Character Set ( UCS, official designation: ISO/IEC 10646), is an international standard to map characters, discrete symbols used in natural language, mathematics, music, and other domains, to unique machine-readable data values.
Variable-width encoding         
A variable-width encoding is a type of character encoding scheme in which codes of differing lengths are used to encode a character set (a repertoire of symbols) for representation, usually in a computer. Most common variable-width encodings are multibyte encodings, which use varying numbers of bytes (octets) to encode different characters.
DBCS         
CHARACTER ENCODING IN WHICH CHARACTERS ARE ENCODED IN ONE OR TWO BYTES
Double Byte Character Set; Double byte characters; Double-byte character-sets; Double-byte character-set; Extended Graphics Character Set; Double-byte character set; Double-byte character; Double byte character; DBCS (encoding); Double-Byte Character Set; Double-byte Character Set; Double byte character set; Doublebyte character set; Triple-byte character set; TBCS (encoding); Triple Byte Character Set; Triple-Byte Character Set; Triple byte character set; Triplebyte character set; Triple-byte Character Set
Double-Byte Character Set
DBCS         
CHARACTER ENCODING IN WHICH CHARACTERS ARE ENCODED IN ONE OR TWO BYTES
Double Byte Character Set; Double byte characters; Double-byte character-sets; Double-byte character-set; Extended Graphics Character Set; Double-byte character set; Double-byte character; Double byte character; DBCS (encoding); Double-Byte Character Set; Double-byte Character Set; Double byte character set; Doublebyte character set; Triple-byte character set; TBCS (encoding); Triple Byte Character Set; Triple-Byte Character Set; Triple byte character set; Triplebyte character set; Triple-byte Character Set
<character> (IBM) double-byte character set. A character set that uses 16 bits to represent a character. (1995-03-21)
DBCS         
CHARACTER ENCODING IN WHICH CHARACTERS ARE ENCODED IN ONE OR TWO BYTES
Double Byte Character Set; Double byte characters; Double-byte character-sets; Double-byte character-set; Extended Graphics Character Set; Double-byte character set; Double-byte character; Double byte character; DBCS (encoding); Double-Byte Character Set; Double-byte Character Set; Double byte character set; Doublebyte character set; Triple-byte character set; TBCS (encoding); Triple Byte Character Set; Triple-Byte Character Set; Triple byte character set; Triplebyte character set; Triple-byte Character Set
A double-byte character set (DBCS) is a character encoding in which either all characters (including control characters) are encoded in two bytes, or merely every graphic character not representable by an accompanying single-byte character set (SBCS) is encoded in two bytes (Han characters would generally comprise most of these two-byte characters). A DBCS supports national languages that contain many unique characters or symbols (the maximum number of characters that can be represented with one byte is 256 characters, while two bytes can represent up to 65,536 characters).
SBCS         
Single-Byte Character Set; Single byte character set; Singlebyte character set; Single byte character sets; Single-byte character sets; Singlebyte character sets; Single byte character; Single-byte character; Singlebyte character; Single byte characters; Single-byte characters; Singlebyte characters; Single-byte encoding; Single Byte Character Set; Single-byte character set; SBCS (encoding); Single-byte Character Set; SB (character set)
SBCS, or Single Byte Character Set, is used to refer to character encodings that use exactly one byte for each graphic character. An SBCS can accommodate a maximum of 256 symbols, and is useful for scripts that do not have many symbols or accented letters such as the Latin, Greek and Cyrillic scripts used mainly for European languages.

Википедия

Variable-width encoding

A variable-width encoding is a type of character encoding scheme in which codes of differing lengths are used to encode a character set (a repertoire of symbols) for representation, usually in a computer. Most common variable-width encodings are multibyte encodings, which use varying numbers of bytes (octets) to encode different characters. (Some authors, notably in Microsoft documentation, use the term multibyte character set, which is a misnomer, because representation size is an attribute of the encoding, not of the character set.)

Early variable width encodings using less than a byte per character were sometimes used to pack English text into fewer bytes in adventure games for early microcomputers. However disks (which unlike tapes allowed random access allowing text to be loaded on demand), increases in computer memory and general purpose compression algorithms have rendered such tricks largely obsolete.

Multibyte encodings are usually the result of a need to increase the number of characters which can be encoded without breaking backward compatibility with an existing constraint. For example, with one byte (8 bits) per character, one can encode 256 possible characters; in order to encode more than 256 characters, the obvious choice would be to use two or more bytes per encoding unit, two bytes (16 bits) would allow 65,536 possible characters, but such a change would break compatibility with existing systems and therefore might not be feasible at all.